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Mass lesions are one of the breast cancer tumors. Mammogram images are 
the first screening tool to detect tumors in the women breast, but due to 
radiologist fatigue, number of false positive (FP) and false negative (FN) 
rates are increased. The main objective of this paper is to develop an 
intelligent computer aided diagnosis (CAD) system that can accurately 


detect mass lesions in digitized mammogram images. The proposed method 


has three stages. The first stage is a preprocessing stage, where the mass 
Keywords: lesion is enhanced using a customized Laplacian filter. Then, multi- 
statistical filters are implemented to detect a potential mass lesion in the 
mammogram images. In the final stage, the number detected FP regions are 
: reduced using five texture features. The proposed algorithm is evaluated 
Mass lesion using 45 mammogram images and the algorithm achieved an accuracy rate 
Statistical filter of 97% in detecting mass lesion with 83% sensitivity rate and 98% 
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1. INTRODUCTION 

One of the most common causes of death in women is the breast cancer. In the past years, number of 
breast cancer cases are increased and specially in young women [1]. Statistically, India nowadays has the 
larger breast cancer mortality in the world [2]. Breast cancer can be detected using different modalities such 
as mammography machine, positron emission tomography (PET), magnetic resonance imaging (MRI), and 
ultrasonography. 

Mammography machine is the most reliable and economic as a first stage in detecting breast cancer 
[3]. Mammogram images can screen mainly three types of breast cancer tumors which are masses, 
architectural distortion, and bilateral asymmetry of the breast [4]. In mammogram images, the radiologist can 
distinguish between benign and malignant mass lesion based on their shape, margin, and density properties 
[5]. Masses that are round in shape, oval and slightly labiate shape are considered as a benign masse region. 
Whereas, irregular shape, and multi lobular mass may considered as malignancy mass region [6]. Margin is 
another way to distinguish between benign and malignant regions, where the detected region is considered as 
benign when it is circumscribed margin but the spiculated or micro lobulated margins are highly considered 
as malignancy regions [7]. 

A computer aided diagnosis (CAD) platform has been used to help radiologists in detecting mass 
lesion accurately. Therefore, many of the authors investigate different intelligent techniques in enhancing the 
performance of CAD system by increasing sensitivity of detecting tumors [8]. Detection of mass clusters in 


Journal homepage: http://beei.org 


2924 A ISSN: 2302-9285 


mammogram images is more difficult than microcalcification since they are highly connected to surrounding 
parenchymal tissues and they are usually surrounded by non-uniform tissue background with similar 
characteristics [9], [10]. Therefore, in this paper, the authors develop a novel CAD system that can accurately 
detect the mass lesion in the mammogram images. The proposed method is divided into three main stages. In 
the first stage, the mammogram image is preprocessed using Laplacian filter. The preprocessing process will 
enhance the contrast of mass region in mammogram image. After that as a second stage, the multi-statical 
filters are implemented to detect potential mass lesion clusters (PMC). Finally, five texture features are 
implemented to reduce the detected false positive (FP) regions. The paper is organized as follows. Literature 
review is presented in section 2. Section 3 explains the proposed method. Section 4 discusses the 
experimental results and section 5 concludes the paper. 


2. LITERATURE REVIEW 

Many authors developed different techniques to enhance the performance of CAD system such as 
[11]. The paper used classification and extreme learning machine (ELM) classifier to detect tumors in 
mammogram images. The algorithm used five textural and morphological features in detecting regions of 
interest. Then support vector machine (SVM) is implemented for classification process. Other authors used 
texture features in CAD system who are [12]. Where Zirinke moments are extracted from the ROI of the 
images. Then, combination between Zirinke of moments and texture features are implemented to classify FP 
and true positive (TP) regions. The classification accuracy for their algorithm reaches 94.11%. Genetic 
algorithm was another tool to accurately detect mass lesions in mammogram images such [13]. They 
accurately classify regions using genetic algorithm by 10.53% comparing with traditional multi-layer 
perceptron neural network (MLP-NN). Contourlet transform algorithm and SVM is proposed by Gedik [14] 
to classify mammogram images. The accuracy for their algorithm reaches 98.467%. Moreover, Hu et al. [15] 
proposed adaptive thresholding segmentation technique. They combine both local threshold and an adaptive 
global threshold segmentation in detection tumors in mammogram images. The algorithm sensitivity was 
91.3% with 0.71 FP per image. According to Surinderan and Vadivel [16], shape features have also been 
used in classification mass lesions. They used classifier and regression tree (CART) classifier to accurately 
detect mass lesions in digitized mammogram images. The proposed classifier successfully detects 93.62%. 
Minavathi et al. [17] proposed another contour algorithm to detect mass lesion. They measure curvature 
angle of each pixel connected in mass lesion boundaries. Their method detect the TP regions with accuracy 
92.7% sensitivity with 0.88. Genetic programming (GP) filter is proposed by Uppal [18] to enhance the 
performance to the classifier of CAD system. They customized a new GP filter to accurately detect mass 
lesion and they successfully detected them with 96.97% with 98.39% sensitivity and 94.59% specificity. 
Mehdy et al. [19] proposed artificial neural network (ANN) to detect mass lesions in mammogram image. 
They investigate different types NN and hybrid NN like SOM to build a novel classifier to detect mass 
lesions. Wang et al. used NN [20] to detect mass lesion. They used convolutional neural network (CNN) to 
which the input consisted of large image window for computerized detection of clustered microcalcifications. 
The proposed CNN classifier has an accuracy rate of 97.1% in detecting mass lesions. Also, Abdel-Zaher and 
Eldeib [21] proposed an intelligent classifier using supervised NN in detection breast cancer. Finally, latent 
dirichlet allocation (LDA) model classifier was proposed by Wang et al. [22] to detect breast cancer. The 
classifier has achieved an accuracy 92.74% in detecting tumors in mammogram images. 


3. METHOD 

The purpose of this section is to discuss the methodologies used to detect and enhance the contrast 
of a mass lesion on mammographic images. According to Figure 1, the proposed algorithm consists of three 
main stages. In the first stage, Laplacian filter is used to enhance the contrast of the mass lesion. Next, a 
potential mass lesion region is detected. As a final step, a texture feature algorithm is applied to mammogram 
images to reduce the total number of detected false positive regions. 


3.1. Pre-processing stage 
As a first stage, the contrast of the mass lesion is enhanced as a prepossessing stage. Laplacian filter 
is one of simplest and most effective filter in enhancing the intensity of mammogram images as in (1): 


Sy) = f(x,y) + c[V* f(x, y)] (1) 


where S(x, y) is the intensity value for the processed image, f(x, y) the intensity value for the input image and 
c is consider as one in this paper. 
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The appearance characteristics of mass lesion is a small white region with diameter less than 50 mm 
appears in the mammogram image [23]. Also, the intensity value of the mass lesion is higher than all the 
surrounding regions (background). So, mass lesion looks like a peak corresponding to surrounding regions. 
Therefore, 45 mammogram images are tested to find intensity value for center of the mass lesions and 
intensity of surrounding regions. It was found that the range between mass lesion and surrounding area is 
between 40 to 250 grey levels. In accordance with these observations, each mammogram image is processed 
using the modified average filter that represent on (2): 


1 
Sk = — toh (2) 


Where S; is the intensity value for the processed image, r; the intensity value for the input image and m and n 
are the mask size. Both Laplcian and modified average filters are implemented on 45 mammogram images. 
The result shows that the contrast of mass lesion is slightly improved as shown in Figures 2(a) and (b). 


Detection Potential Mass Clusters 


Extraction Texture Features 


Processed Image 


(a) (b) 
Figure 1. Mass detection and Figure 2. Mass contrast enhancement in mammogram image 
classification stages (a) original image and (b) enhanced mass image 


3.2. Detection potential mass cluster stage 

As presented before, the visual characteristics of mass lesion is a small white region in range of 25 
to 50 mm [24]. To detect a PMC, two concentric circular masks were designed as shown in Figure 3. So, 
when average the inner mask is greater than the average of the outer mask this will be considered as a PMC. 
The size of both inner and outer masks are design based on the image resolution of both databases 
(University of South of Florida (USF) and mammographic image analysis society (MIAS) databases) which 
are 45x45 um [25]. This concentric circular mask has been tested on 45 mammogram images and it was 
found that all mass lesions are detected but with high number of FP regions. The PMC algorithm is designed 
based on the fact that intensity value of mass lesion is higher than the surrounding regions as shown in 
Figure 4. Therefore, when the average of inner mask is greater than the average of outer mask it will be 
considered as a PMC. 

PMC algorithm is implemented on all mammogram images. All mass lesions are accurately detected 
but unfortunate with a very high number of detected FP regions. This huge number of detected FP regions 
will decrease the sensitivity of our proposed CAD systems as shown in Figures 5(a) and (b). So, we need 
another stage to reduce the number of detected FP regions which will be presented in the next section. 


Figure. 3. Two concentric circular masks 
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Figure 4. The mesh grid of mass 


(a) (b) 


Figure. 5. PMC processing (a) the original dense breast image and (b) the processed dense breast image 


3.3. Texture feature extraction 

Mass lesions are accurately detected using the PMC algorithm but with a high number of detected 
FP regions. So, further stage is needed to reduce the number of detected FP regions. In this paper, texture 
features extraction technique is used to reduce the number of detected FP regions. 

Initially, proper samples of malignant and benign datasets were collected from 45 mammogram images. 
Clusters of size 21x21 are manually cropped from these mammogram images. Therefore, a dataset of 611 
expertly identified as an actual TP clusters and 906 an actual FP cluster were collected and organized in two 
categories (TP and FP clusters). In this paper the first order texture features base are calculated from the intensity 
histograms renormalized to give the histogram probability P(i) defined in (3) for each actual TP and FP cluster: 


P(i) = Diss h(D)/MN (3) 


where h(i) is the intensity histogram and M, N are the image region’s height and width respectively. 

Then five features are calculated and modified comparing with AbuBaker [26] by considering the 
upper and lower grey levels of the mass in the mammogram images as shown in (4) to (8). 
a. The modified mean feature (u): 


H = Xito iP (Ù) (4) 
b. The modified entropy feature (E) 
E = — X210 PW) log2[ PW] (5) 


c. The modified standard deviation feature (o) 
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o= [NPG ar PO (6) 


d. The modified third order of moment feature (M3) 
M; = Yis4o(i — H)° PC) (7) 
e. The modified kurtosis feature (K) 


K = 0 * yo — w)*P(i) — 3 (8) 


4. RESULTS AND DISCUSSION 

The proposed smart classifier was evaluated using 45 mammogram images from two databases 
(USF and MIAS database). At the beginning, the PMC algorithm is implemented to accurately detect mass 
lesion in mammogram images. Then a smart classifier that used five texture features (entropy, mean, standar 
deviation (STD), kurtosis, and skewness) was implemented to reduce the number of detected FP clusters. The 
results were compared with different author’s results as shown in Table 1. 

It is also worth mentioning that the TP and FP rates in these publications are reported for different 
mammogram images and using different benchmarks. In addition, most of these algorithms are tested on the 
MiniMIAS database, which contains smaller images, mainly 1,024x1,024, compared to the MIAS images we 
have used for evaluating our algorithm, which range from 623x1,774 to 1,269x2,132. Hence, we have 
conducted additional evaluations as described. 


Table 1. The comparison results using other author’s algorithms 


Algorithm TP FP 
Brijesh and Ping [27] 85 NA 
Linguraru et. al. [28] 91 0.95 per image 
Peng et. al. [29] KD-GA 98.9 40% FPF 
Peng et. al. [29], GA 85 20% FPF 
Zhang et. al. [30] discriminate 70 NA 
Zhang et. al. [30], logistic regression 70 NA 
Sentelle and Sutton [31] 94 17 per image 
Malar et. al. [32] 94 NA 
Mohanalin et. al. [33] 96.55 0.4 per image 
Oliver et. al. [34] 80 1 per image 
Rizzi et. al. [35] 98 1 per image 
Ravi et. al. [36] 91 1.63 per image 
The proposed algorithm 97.6 0.6 per image 


From Table 1, it is clearly noticed that performance of the proposed algorithm is better than other 


algorithms in both TP and FP clusters. The algorithm achieved an accuracy rate around 97.6% with very low 
FP regions which is 0.6 per image. Figures 6(a)-(d) show some images with results from the mass detection 
algorithm superimposed. 


(b) (c) (d) 


Figure 6. Accurate detection of mass lesions (a) original fatty breast, (b) processed image, (c) original dense 
breast, and (d) processed image 
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5. CONCLUSION 

A new novel technique to accurately detect and classify the mass lesion in mammogram images is 
proposed in this paper. Initially, the contrast of mass lesion is enhanced using a customized Laplacian filter. 
The enhancement technique is slightly enhancing the contrast on mass lesion and it become more brighter 
than the surrounding regions. After that, two concentric circular masks were implemented to detect all the 
peaks in the mammogram images. These detected peaks are considered as PMC since all mass lesions are 
detected. Finally, five texture features were used to reduce the number of detected FP regions. The algorithm 
is evaluated by processing 45 mammogram cases from two databases (MIAS and USF database). In all cases 
the algorithm can successfully detect the mass lesion with an accuracy rate of 97.6% with minimum number 
of FP region 0.6. 
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